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Abstract 

We present a technique of proving lower bounds for noisy computations. This is achieved by a 
theorem connecting computations on a kind of randomized decision trees and sampling based algorithms. 
This approach is surprisingly powerful, and applicable to several models of computation previously 
studied. 

As a first illustration we show how all the results of Evans and Pippenger (SIAM J. Computing, 
1999) for noisy decision trees, some of which were derived using Fourier analysis, follow immediately 
if we consider the sampling-based algorithms that naturally arise from these decision trees. 

Next, we show a tight lower bound of fl(TVloglog TV) on the number of transmissions required 
to compute several functions (including the parity function and the majority function) in a network 
of TV randomly placed sensors, communicating using local transmissions, and operating with power 
near the connectivity threshold. This result considerably simplifies and strengthens an earlier result 
of Dutta, Kanoria Manjunath and Radhakrishnan (SODA 08) that such networks cannot compute the 
parity function reliably with significantly fewer than TV log log TV transmissions. The lower bound for 
parity shown earlier made use of special properties of the parity function and is inapplicable, e.g., to 
the majority function. In this paper, we use our approach to develop an interesting connection between 
computation of boolean functions on noisy networks that make few transmissionss, and algorithms that 
work by sampling only a part of the input. It is straightforward to verify that such sampling-based 
algorithms cannot compute the majority function. 


1 Introduction 

We present a novel technique for analyzing randomized decision trees. This method does not depend upon 
the specfic function being computed by the decision tree and can be applied for proving lower bounds in 
various models for a variety of functions. 

We introduce the technique in the simplistic setting of e-noisy decision trees, e-noisy decision trees can 
be viewed as a simple kind of randomized decision trees. As an application of our technique, we show 
how it provides elementary and unified proofs of all the lower bounds of Evans and Pippenger 1EP991 for 
average noisy decision tree complexity of several types of functions. Their work introduced the notion of 
noisy leaf complexity, which was analyzed using Fourier methods. However, as we show, our technique 
yields elementary arguments that places these results in a compact and unified framework. 

*A preliminary version of this work appeared in the Proceedings of the 49th Annual IEEE Symposium on Foundations of 
Computer Science, 2008, pp. 394-402. 
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We then use our technique to derive the main result of this paper - a lower bound for wireless sensor 
networks. This simplifies and extends a lower bound of llDKmR081 IDKmR 15 1 for this model. A wireless 
sensor network consists of sensors that collect and cooperatively process data in order to compute some 
global function. The sensors interact with each other by transmitting wireless messages based on some 
protocol. The protocol is required to tolerate errors in transmissions since wireless messages typically are 
noisy. 

In the problem we study, there are n sensors, each with a boolean input, they are are required to coop¬ 
eratively compute some function of their inputs. The difficulty of this task, of course, depends on the noise 
and the connectivity of the network. In this paper, we assume that each bit sent is flipped (independently for 
each receiver) with probability e > 0 during transmission. As for connectivity, we adopt the widely used 
model of random planar networks. Here the sensors are assumed to be randomly placed in a unit square. 
Then each transmission is assumed to be received (with noise) by the sensors that are within some prescribed 
radius of the sender. The radius is determined by the amount of power used by the sensors, and naturally one 
wishes to keep the power used as low as possible, perhaps just enough to ensure that the entire network is 

connected. It has been shown by Gupta and Kumar itGKOOl that the threshold of connectivity is 9 

(with a radius much smaller than this the network will not be connected almost surely, and with radius much 
larger it will be connected almost surely). 

It was shown by Dutta, Kanoria, Manjunath and Radhakrishnan IlDKmROSl lDKmRT5l . that computing 
the parity of the inputs requires f l(n log log n) transmissions. This result showed that the protocol presented 
by Ying, Srikant and Dullerud liYSD061 for computing the sum of all the bits (and hence any symmetric 
functions of these bits) is optimal. The lower bound argument in IDKmR08l IDKmRl5l depended strongly 
on the properties of the parity function. In particular, the argument was not applicable for showing su- 
perlinear lower bounds for the majority and other symmetric functions in this model. Using the technique 
presented in this paper, we can now give optimal lower bounds for several functions, including the majority 
function. 

The starting point of our method is the connection between noisy wireless network protocols, and certain 
randomized decision trees. This connection was derived in |[DKmR08llDKmRT5ll . and it in turn made crucial 
use of a result in llGKS05ll . The lower bound for parity in liDKmR08lIDKmR~l5l was derived by rearranging 
the randomized decision trees obtained from wireless protocols computing parity. In this work, we show 
that computations on decision trees that arise in our context can be simulated by randomized algorithms that 
leave a non-trivial fraction of then - inputs unread. Once this is established, it is relatively straightforward to 
conclude that several functions cannot be computed in this model. 

In order to state our result formally we need a formal definition of the model of noisy communication 
networks, which we now reproduce from liDKmR08l . 

Definition 1 (Noisy communication network, protocol). A communication network is an undirected graph 
G whose vertices correspond to processors and edges correspond to communication links. A message sent 
by a processor is received by all its neighbors. 

Noise: In an e-noise network, the messages are subjected to noise as follows. Suppose processor v sends 
bit b in time step t. Each neighbor of v then receives an independent noisy version ofb; that is, the 
neighbor w of v receives the bit b © r] w p where r] Wt t is an e-noisy bit (that takes the value 1 with 
probability e and 0 with probability 1 — e), these noisy bits being mutually independent. 

Input: An input to the network is an assignment of bits to the processors, and is formally an element of 

{ 0 , 1 }^°). 
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Protocol: A protocol on G for computing a function f : {0, l\ v T- : ) —>• {0,1} works as follows. The proces¬ 
sors take turns to send single bit messages, which are received only by the neighbors of the sender. In 
the end, a designated processor v* E V(G) declares the answer. The cost of the protocol is the total 
number of bits transmitted. A message sent by a processor is a function of the bits that it possesses 
until then. The protocol with cost T is thus specified by a sequence of vertices (v\, V 2 , ■ ■ ■ , vt) and 
a sequence of T functions (<7i, <72> • ■ ■ > 9t}> where gt : {0, l} 3t —>• {0,1} and jt is the number of bits 
received by vt before time step t {plus one ifvt is an input processor). Furthermore, vt = v*, and the 
final answer is obtained by computing g^. Note that in our model the number of transmissions is the 
same for all inputs. 

Error: Such a protocol is said to be a 5-error protocol, if for all inputs x E {0, 1}' i(;> , Pr[output = 
f{x)\ >1-6. 

In this paper, we consider networks that arise out of random placement of processors in the unit square. 

Definition 2 (Random planar network). A random planar network M{N, R) is a random variable whose 
values are undirected graphs. The distribution of the random variable depends on two parameters: N, the 
number of vertices, and R, the transmission radius. The vertex set of Af(N, R) is {Pi, P 2 , . ■ ■, P/v}. The 
edges are determined as follows. First, these processors are independently placed at random, uniformly in 
the unit square [0, l] 2 . Then, 


E(Af) = {(Pi,Pj) : dist (P u Pj) < R}. 


Our main result is the following. 

Theorem 3 (Lower bound for majority). Let R < N~ 3. Let 5, e E (0, i). Then, with probability 1 — 
o(l) (over the placement of processors) every 5-error protocol on Af{N, R) with e-noise for computing the 
majority on N bits requires Ll(N log log jY) transmissions. 

Remarks: 

• It was conjectured in HDKmR08l that one cannot approximate the sum to within an additive error of 
N a (for some a > 0) using O(N) transmissions. Using the techniques of this paper, we can prove 
this conjecture (details omitted from this paper). 

• Typical protocols in wireless networks operate by computing using broadcasts in cells, where there 
is complete connectivity between the processors, and then aggregating the information across the cells 

(e.g. ||KM07 1. I IYSD061 1). Here it makes sense to consider functions of the form f(g(X\), g{Xf ),..., ry(X/.j), 
where X{ is the part of the input that falls in cell 1 , f is some symmetric boolean function and g is 
some function to be computed inside each cell. For example, if / and g are both parity, then this 
corresponds to the parity function on the entire input. Our techniques show that for most symmet¬ 
ric functions, and all g with high sensitivity, one requires Q(N log log N) transmissions. (Details 
omitted.) 

As stated earlier, our technique yields simple proofs of previous lower bound results on noisy decision 
trees. 
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Definition 4 (Noisy decision trees). A boolean decision tree T for input x = (< x\,..., x n ) € {0, \} n is 
a binary tree in which each internal node v has a label l(y ) € [1, n] and each leaf £ has a value val (£) € 

{0,1}. The two outgoing edges of each internal node are labelled by the value 0 and 1. The computation 
ofT on input x is the unique path starting at the root of the tree and continuing up to a leaf as follows: 
at internal node v, the outgoing edge labelled with xu v \ is chosen to get to the next node. The result of 
the computation is va I {£) where £ is the leaf reached by the computation. In an e-noisy boolean decision 
tree, at each internal node, the incorrect outgoing edge is chosen with probability e independent of former 
choices. Equivalently, each internal node v is assigned a binary random variable r) v that takes the value 1 
with probability e independently. Then, on reaching internal node v, the outgoing edge labelled xp v \ © r/„ is 
used to determine the next node. For each input x, the computation path and the value T(x) = £(x) output 
by the tree is a random variable. Let depth f r5 (/) be the minimum depth of an e-noisy decision tree T such 
that Pr \T(x) f f(x')} < 5, where the probability is over the input x chosen uniformly from {0,1}” and the 
internal randomness ofT. 

The main results of Evans and Pippenger I EP991 were (a) depth f a (Parity r J = Q(n(log n)/ log(l/e)), 

(b) depth £ 3 (/) = U(n(logn)/(og(l/e)) and for almost all functions; (c) depth £ 3 (/) = fl(nlogs) if / is 
n (l — i)-resilient. 

In Section [2 we provide a simple proof of the lower bounds. 

1.1 Related work 

Noisy broadcast models have been studied in the past where all sensors receive all messages (with inde¬ 
pendent noise). Gallager llGa!881 showed a remarkable protocol to collect all input bits at one sensor using 
0(N log log N) transmissions. Clearly, this give the same upper bound for computing any function of 
the input bits. Several other works have focussed on constructing protocols for specific functions in vari¬ 
ants of the noisy broadcast model, e.g., Feige and Raghavan I lFKOO ll. Newman I New04i l. Kushilevitz and 
Mansour I KM9811 . and Goyal, Kindler and Saks IIGKS051 . Using an insightful combination of information- 
theoretic and fourier based methods, Goyal, Kindler and Saks HGKS051 showed that Gallager’s protocol was 
the best possible for collecting all the bits. 

In sensor networks, considerations of power impose stringent limits on the transmission radius. In this 
paper, we study networks arising from random placement of sensors with transmission radius around the 
threshold required to ensure connectivity. As mentioned above, in this model Ying, Srikant and Dullerud ttYSD06t 
devised a protocol for computing the sum using 0(N log log N) transmissions. Kanoria and Manjunath HKM07I 
showed a protocol with O(N) transmissions to compute the OR function. Making crucial use of a result of 
Goyal, Kindler and Saks HGKS051 . it was shown in liDKmR08l lDKmR151 that computing parity requires 
kl(N log log N) transmissions which was then extended to several other functions in the initial presentation 
of this workl lDROSI . 

Unlike in the model of noisy wireless networks, several lower bounds results have appeared in the 
literature on noisy decision trees. Reischuk and Schmeltz HRS911 showed that almost all boolean functions 
of N arguments require Q(A r log N) queries. Feige, Peleg, Raghavan, and Upfal I FRPU94 1 showed an 
Q(N log N) lower bound for the parity function. Evans and Pippenger IIEP9911 presented arguments to show 
that these results also hold in the average case. 
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1.2 Techniques 

We now present an overview of the proof technique used to derive the lower bounds in this paper. A very 
detailed discussion of all the techniques can be found in the Phd thesis ltDut09l . 

The proof of our main result,like the proof in llDKmR08l . first converts computations on noisy broadcast 
networks to computations on randomized decision trees. 

Definition 5 (Randomized decision tree). A randomized decision tree is a model for processing inputs in 
{0, 1 For an internal node v of the tree let vj_, be its left child and vr its right child. Each internal 
node v of the tree is labelled by a pair { i v ,g v ), where i v E { 1 , 2 ,..., n}, and g v : {0, 1 } —>• {vl, vr} is 
a noisy function, whose output depends on its input and some internal randomness that is independent for 
different noisy computations performed in the tree. Once an input x = (x \ , a ;2 ,..., x n ) E {0, l} n is fixed, 
the (random) output of the tree is determined by the following natural computation. We start at the root, and 
when we arrive at an internal node v, we determine the next vertex by evaluating g v (x,; u ). The (random) 
output of the tree on input x E {0, l} n is the 0-1 label of leaf reached. 

Our lower bound for noisy broadcast networks will follow from a lower bound we show for randomized 
decision trees that arise from them. A central notion in our analysis, is the amount of uncertainty about each 
variable that remains at the end of the computation. We will use the notion of overlap between distributions 
to quantify this uncertainty. 

Definition 6 (Overlap between distributions). Let Dq and I)\ be distributions on some set L. The overlap 
between Dq and D\ is given by overlap(Zlo, Df) = min{.Do(f?), Di(£)}. Note that this quantity is 

directly related to the l\ distance between Dq and D\: ||.Do>-^i||i = 2(1 — overlap(Ho, Dfij). 

Definition 7 (Uncertainty). Let T be a randomized decision tree for single bit inputs, where each internal 
node computes a function based x E {0,1}. Let Dq be the distribution on T’s leaves when x = 0, and let 
D\ be the distribution when x = 1. Then, the the uncertainty of T about x, denoted by 3{T ), is given my 
overlap) I)( h D\). We will generalize this notion to trees with inputs in {0, l} fc . The uncertainty about x r is 

given by 3i(T) = min 7 -/ overlap(7 _/ ). where the minimum is taken over all trees T' obtained from T by 

(a) retaining the functions at nodes that query x, and 

(b) replacing the functions at nodes that query variables Xj (j f i) by arbitrary constant functions; 
Finally, define /3(T) = min* fifiT). 

For trees with multiple inputs, fifiT) denotes the uncertainty that remains about x, no matter how the 
decisions are made at nodes that query other variables. We can now state the following crucial connection 
between broadcast protocols and randomized decision trees which follows from arguments in flDKmR08l . 

Proposition 8 . If there is a broadcast protocol with Nd broadcasts for computing a the majority of N 
bits, then there is a randomized decision tree ‘T for computing the majority of N' > y/~N bits such that 
fi{T) > exp(— exp(0(d))). In particular, ifd< ^ log log N for some constant C, then /3(T) > N~ To. 

The main contribution of this work is a technique for analyzing randomized decision trees with non¬ 
trivial uncertainty. Roughly, we show that if the uncertainty about a variable is non-trivial, then the com¬ 
putation can be performed by leaving the variable unread with some non-trivial probability. For example, 
consider a trivial tree with one root and two leaves. At the root we read a variable x E {0,1}, and moves to 
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the left child with probability | + (— l) x e. This tree can be simulated as follows. With probability 1 — 2e, 
we do not read x at all, and move left or right with equal probability. With probability 2e we read x and 
move left if x = 0 and move right otherwise. If all nodes are of this kind and the tree has small depth, then 
it is not hard to see that we can simulate its computation leaving several variables unread. This is precisely 
the situation in the model of decision trees studied by Evans and Pippenger llEP99 1, which explains why 
our technique is effective there. However, applying this idea to the randomized decision trees guaranteed by 
Proposition [8]requires more careful analysis. The detailed argument is presented in Section [3j 


2 The Evans-Pippenger lower bounds revisited 


In this section, we show that the three average case lower bounds of Evans and Pippenger HEP99H follow 
immediately by considering sampling-based algorithms that arise naturally from noisy decision trees. 


Definition 9 ((r, <5)-sampling algorithm). We say that a randomized algorithm A is an (r, 5)-sampling al¬ 
gorithm if Py\A leaves at least r variables unread] >1 — 5. 


Definition 10 (Robust function). We say that f : {0, l} n —» {0,1} is an (r, 7 } -robust function, if for every 
subcube L of dimension r, 


1 

2 


7 < Pr [f(x) 




Once these definitions are in place, the proofs the results of Evans and Pippenger HEP991 follow easily 
from the following observations. 


1 . e-noisy boolean frees of small depth can be simulated by randomized algorithms that typically leave 
many variables unread. 

Lemma 11. Suppose T is an e-noisy boolean decision tree with n variables and depth at most kn. 
Then there is a randomized algorithm A that on all inputs simulates the computation on T (producing 

2k 2k 

the same distribution on the leaves), and with probability at least 1 — exp(— leaves at least '—u 

variables unread. 

This lemma is the key to the analysis in this section. We present its elementary proof below. 

2. Our next observation states that randomized algorithms of the kind promised by the above theorem 
cannot compute a robust function with small error. 

Lemma 12. If A is an (r, 5)-sampling algorithm computing an (r. 7 ) -robust function f. Then, 

Pr [A(x) = f(x)] < \ + 5 + 7. 

i€{0,l} rl 2 

3. The three types of functions considered by Evans and Pippenger are robust. 

Lemma 13. (a) Almost all functions f : {0, l} n —>• {0,1} are (6 log log n, o(l))-robust. 

(b) The parity function is (1, 0) -robust. 

(c) A t-resilient function is (n — t, 0 )-robust. 
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From LemmafTTlit follows immediately that any e-noisy decision tree of depth at most §“^^ 77^1 can 
be simulated using an (r, |)-sampling algorithm. Then, from Lemma [T2l it follows that any such tree for 
an (r, hj-robust function makes error at least \ | + 5. By combining this with Lemma[[3l we obtain the 

following. 

Theorem 14 (Evans and Pippenger IIEP9911 ). 1. depth e 3 (/) = U(n(logn)/log(l/e)); for almost all 

functions; 

2. depth 3 (Parity n ) = U(?i(log n)/ log(l/e)); 

’ 4 

3. depth £ 3 (/) = fl(nlogs) if f is n (l — -resilient. 

It remains to prove the lemmas claimed above. 

Proof of Lemma\]l\ The randomized algorithm works by simulating the computation by the noisy boolean 
decision tree stalling at the root. The algorithm has a boolean random variable B v for each internal node v 
of the decision tree. Each B v takes the value 1 with probability 2e independently. At internal node v of the 
tree, if B v = 1, the sampling algorithm chooses one of the outgoing edges with probability half each, to get 
to the next node. If B v = 0, the algorithm reads the value of the input variable xu v \ (without any error), and 
chooses the outgoing edge with that label. On reaching leaf £(x), which is a random variable, the algorithm 
outputs val(£(x)) as the result. It is easy to see that for any input x, the distribution on leaves reached by the 
sampling algorithm is exactly the same as that reached by the noisy decision tree. 

Fix an input x and a leaf £ reached by the simulation. We will show that conditioned on arriving at 

2k 

this leaf, the algorithm leaves at least '—n variables unread with high probability. Suppose the variable x t 
appears Ay times in T on the path from the root to l. Then, the probability that xy is not read conditioned on 

the computation reaching i is at least (a) ‘ £ 4. Since the depth of the tree is at most nk, there are at 
least 7 ) variables that appear at most 2k times on the path to £. Each of these variables is independently left 
unread with probability at least e 2k . Using the Chernoff bound, with probability at least 1 — exp(—^p), 

2k 

the algorithm leaves at least variables unread. Since this claim is true conditioned on each leaf, it also 
holds overall. □ 

Proof of LemmaU2\ The probability that fewer than r variables are left unread is at most 5. Conditioned on 
the algorithm leaving r variables unread, the probability that its output is correct is at most ^ + 7 because / 
is (r, 7) -robust. □ 

Proof of Lemma [73] The second and third claims follow immediately from definitions. We justify the first 
claim using the following routine calculation. Consider a d-dimensional subcube of the boolean hypercube 
{0, l} ri . Pick a random function /, and let X be the random variable denoting the number of points in 
the subcube where / takes the value 1. We have E[X] = 2 d /2. Let t = ^ 3 . By the Chernoff bound, 
Pr[|X — E[X]\ > t2 d ] < 2exp(— t 2 2 d ) = 2exp(—2 d / 3 ). Taking the union bound over all subcubes of 
dimension d, the probability that / has a bias of more than t = 7^3 on any such subcube is at most 
(^) (2exp(—2 d / 3 )). Thus, for d > 6 log log n, with probability 1 — o(l), / has o(l) bias on every subcube 
of dimension d. □ 
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3 Proof of main Theorem 


In this section we prove our lower bound on the number of transmissions needed to compute majority in a 
noisy wireless network. By Proposition [U it is enough to show that randomized decision trees with high 
uncertainty cannot compute majority. We will first show how such trees can be simulated by sampling 
algorithms. The result follows from this because it is straightforward to verify that sampling algorithms that 
leave a super-constant number of variables unread cannot compute majority with low error. 

Definition 15 (Sampling-based algorithm). By a sampling-based algorithm for computing a function f : 
{0, l} n —>• {0,1}, we mean an algorithm of the following kind. In the sampling phase, the algorithm 

uses n sampling probabilities q \ ■(]■> . (in E {0,1}. Given an input x = (x\. x- 2 -..., x n ), the algorithm 

constructs a string y E {0,1. *}" from x, by independently replacing X{ by a * with probability 1 — <p. In 
the second phase, it declares its guess for f(x ) based on y alone. 

The main part of the argument is contained in the following theorem. 

Theorem 16. Let T be a randomized decision tree with inputs from {0,1 } k that computes a function f with 
error at most 5. Then, there is a sampling algorithms A which independently samples the i-th variable with 
probability qi = 1 — fl(T), and computes f with error at most 5. 

Preliminaries. First, we need some notation. For an internal node v of the tree, let vl denote its left 
child, and vr its right child. Suppose the input x, is queried at a v. The tree T specifies the probabilities 
for the computation to move to each child for each possible value of x t . For the node v, let Xv be the 
event that the computation on T reaches the node v. Clearly, \ v is the intersection of independent events 
{Xv,i ■ i = 1,2,..., k}, where Xv,i i s the event that the computation reaches node v assuming and the 
choices at all nodes not labelled i don’t leave the path. Note that the probability of Xv,i depends only on the 
value of xp, let p v .j(z) be this probability. Then, the probability of the computation reaching the node v on 
input x = (x l ,..., x k ) is precisely flLi Pv,i{xf). 

Before formally stating the proof of the theorem, it will be useful to present a natural method for ‘com¬ 
puting’ Pi(T). This method works bottom up, assigning a value B v y to the node v of the tree. It will turn 

out that /3 roo t,i = fl(T). The intermediate values 8 v , r produced in this algorithm, will be used crucially 
when our final sampling algorithms simulates the computation of T. In fact, 8 VJ has the following natural 
interpretation. Consider a tree T' as in the definition of 8 % (T). Let Dq and I)\ be the distributions on the 
leaves of T' when xq is set to 0 and 1. The minimum overlap (over all possible such T') between Dq and I)\ 
when restricted to the leaves in the subtree rooted at v, is the quantity /3 v y. With this interpretation, consider 
the following computation. 

• for a leaf v, let f3 Vji = min{p„ i j(0),p^ i i(l)}. 

• for an internal node v with children vl and vr, where Xj is queried, 

o _ f min {Pv L ,i, Pv R ,i} if i ^ 3 

V ’ J \ Pv L ,i + Pva.i if i = j 

The following claim, which we state without a formal proof, is now immediate. 

Proposition 17. For ?' = 1, 2,... , k we have 

float,* = fl(T) 

fl,i < mm{p Vti (0),p Vt i(l)}. 


Proof. Our goal is to simulate the computation of this tree using a randomized sampling algorithm, where 
input Xi is read independently with probability 1 — ff(T). We want to ensure that for every input the leaf 
reached in the end of this simulation has the same distribution as in the original tree T. To specify how this 
simulation is to be performed, we need to determine the following. 

For each internal node of T. we need the transition probabilities for moving to each child when 
the input for that node is available in the sample, and when it is not. For this, we will specify 
for each internal node v, a function £ v : {0,1} -» [0,1] and a value a v € [0,1], which are 
to be used as follows. When the computation reaches node v, where Xi is to be read and the 
value of Xi is available, then the next node is vr with probability £ v {xf) and vr with probability 
1 — £ v (xi); if Xi is not available then the next node is vr with probability a v , and vr with 
probability 1 — a v . 

Once £ v and a v have been specified, we may consider the events \v and Xv,i as before. Note that the 
events in {xv,i '■ * = 1,2,..., k} are independent and their intersection is precisely Xv Let p v (x) be the 
probability of \v in the simulation for input x, and similarly let p v ,i(z) be the probability of the event Xv,i 
when Xi = z (note that the probability of Xv,i depends only on x,). Clearly, p v (x) = nf =1 p v ,i{xi), and to 
show that our simulation is faithful to the original computation, it will suffice to verify that p v/l = p VA for 
all i. The rest of the proof consists of two steps. 

Step 1: Using the values /3 Vj i defined above, define £ v and a v . 

Step 2: Show that for i G {1,2,..,, k} and each node v € T and z € {0,1}, we have p v ,i(z) = p v ,i(z). 

We now implement this two-step plan. Consider the first step. Recall the values of /3 V ^ defined above using 
a bottom up computation on the tree T. We can now define a v right away based on the f3 v f s computed 
above. If v has label i, then 


Now, consider an internal node v. Let vi,V2, ■ ■ ■ ,v r = v he a path in the free from the root v\ to the node 
v. Let p Vj i denote the probability that the computation reaches node v and is not sampled, assuming that 
the choices at nodes not labelled i do not cause the computation to leave the path. Thus, 

r —1 

fiv,i — P ~ '^i] — firoot,i 11 'Yji 

3 =1 


where E % is the event “x t is sampled,” and 


7i 


1 if Xi is not queried at Vj 

a Vj if Vj +1 is the left child of Vj 

1 — a vj+\ if v j +1 i s ih e right child of Vj 


We will show that the following choice for £ v ensures that the probability of reaching every node is preserved 
in our simulation: 

£ ^ _ PvL,i ( z ) ~ PvlA 
Pv,i{p) Pv,i 

This completes Step 1. 
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Now, we move to Step 2 and verify that these definitions ensure that p Vj i(z) = p v ,i(z). Clearly, the claim 
is true for the root, for both quantities are 1. Suppose the claim is true for a node v. We will now show that 
it is true of vl and v /,> as well. Consider, vj J . We have 

Pv L ,ii z ) = Fl 'lXv L ,i A £i\ + Pr [xv Ll i A -<£*], 

where the probabilities are computed assuming that Xi = z. Using our assumption that the claim holds for 
v, we can compute the first term as 


Pr [Xv,i A Si] ■ l v (z) = {p v ,i(z) ~ Pv,i) ■ PvL)l ^ —= p VL A z ) ~ Pv L ,i- 

PvA z ) ~ Pv,i 

By definition, the second term is precisely $ VL ,i- It follows that p VL , t (z) = p VL; j(z), and the claim holds for 
vl- A similar calculation shows that the claim holds for vr as well. This completes Step 2. 

Thus, the simulation induces the same distribution on the leaves of T as the original computation, and 
therefore computes / with the same probability of error. □ 

The following proposition states that a sampling algorithm that leaves many variables unread cannot 
compute majority reliably. 

Proposition 18. Suppose N = 2 k + 1 is odd. Let Abe a sampling algorithm with inputs from {0,1 } ,v , 
which leaves each variable unread with probability N~&. Let X be uniformly distributed on all strings with 
k or k + 1 ones. Then, Pr[y4(X) = Parity(X)] < \ + o(l). 

Proof. (Sketch.) By the Chernoff bound, with probability 1 — o(l), the size of the sample picked by A is 
N — f)(N A A'y \y e ma y assume that A bases its decision only on the number of l’s in the sample. Let Dq be 
the distribution of the number of l’s in the sample conditioned on the number of l’s in X being k and let D\ 
be the corresponding distribution conditioned on the number of l’s in X being k + 1. A direct computation 
shows that the relative entropy 


S(A)pt) = I>o« lo gg|y = 0 (l). 

It follows that i\ distance between 1)q and D\ is o(l). Our claim follows from this. □ 

Proof of main theorem. Proposition [8] guarantees that if we have a protocol for computing majority with 
constant error that uses less than ^ N log log N transmissions for some constant C. then we have a random¬ 
ized decision tree to compute majority of s/N bits with constant error and AT) > N~ To. Theorem [T6l 
then guarantees we have a sampling algorithm A that samples every variable with probability 1 — N~ To, 
and yet manages to compute the majority of s/N bits with constant error. But this is impossible by Propo¬ 
sition [T8j □ 

4 Conclusions 

In this paper, we presented a technique of converting computation on randomized decision tree model to 
computation on a model of sampling algorithms. We related the uncertainty of an input variable in the 
randomized decision tree model to the probability that the variable is left unread by the sampling algorithm. 
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We showed the power of this technique for proving lower bounds by providing elementary arguments 
to prove all the lower bounds on average noisy decision tree complexity for computing various functions 
presented by Evans and Pippenger IEP991 . 

Using our technique, we then presented lower bounds for wireless communication networks where there 
is a restriction on transmission power. Any bit sent by a transmitter is received (with channel noise) only by 
receivers which are within the transmission radius of the transmitter. We showed that to compute the parity 
and majority function of N input bits with constant probability of error, we need Ll(N log log N) trans¬ 
missions. This result simplifies and extends the same earlier lower bound for parity liDKmR08l IDKmR15l 
and nicely complements the upper bound result of Ying, Srikant and Dullerud I IYSD06I . which showed that 
0(N log log N) transmissions are sufficient for computing the sum of all the N bits. Our result also implies 
that the sum of N bits cannot be approximated up to a constant additive error by any constant error protocol 
for J\f(N, R ) using o(N log log N) transmissions, if R < for some (3 > 0. 
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